Search CORE

19 research outputs found

Protein interactions across and between eukaryotic kingdoms: networks, inference strategies, integration of functional data and evolutionary dynamics

Author: Pevzner Samuel J
Publication venue: Boston University
Publication date: 01/01/2013
Field of study

Thesis (Ph.D.)--Boston UniversityHow cellular elements coordinate their function is a fundamental question in biology. A crucial step towards understanding cellular systems is the mapping of physical interactions between protein, DNA, RNA and other macromolecules or metabolites. Genome-scale technologies have yielded protein-protein interaction networks for several eukaryotic species and have provided insight into biological processes and evolution, but many of the currently available networks are biased. Towards a true human protein-protein interaction network, we examined literature-based aggregations of lowthroughput experiments, high-throughput experimental networks validated using different strategies, and predicted interaction networks to infer how the underlying interactome may differ from current maps. Using systematically mapped interactome networks, which appear to be the least biased, we explored the functional organization of Arabidopsis thaliana and characterize the asymmetric divergence of duplicated paralogous proteins through their interaction profiles. To further dissect the relationship between interactions and function enforced by evolution, we investigated a first-of-its-kind systematic crossspecies human-yeast hybrid interactome network. Although the cross-species network is topologically similar to conventional intra-species networks, we found signatures of dynamic changes in interaction propensities due to countervailing evolutionary forces. Collectively, these analyses of human, plant and yeast interactome networks bridge separate experiments to characterize bias, function and evolution across eukaryotic kingdoms

Boston University Institutional Repository (OpenBU)

Tandem mass spectrometry data quality assessment by self-convolution

Author: A Shevchenko
AA Bharath
AL McCormack
AL McCormack
Andrew Keller
Bin Ma
BJ Cargile
C Yu
CG Herbert
D Fenyo
DC Barbacci
DL Tabb
DN Perkins
F Desiere
HI Field
JE Elias
JE Syka
Jimmy K Eng
JK Eng
JV Puymbrouck
K Biemann
K Biemann
Keng Wah Choo
KR Clauser
LY Geer
M Kinter
M Mann
Marshall Bern
N Zhang
P Roepstorff
PA Pevzner
Purvine Samuel
RA Zubarev
Randy J Arnold
Richard S Johnson
RS Johnson
S Sunyaev
Salmi Jussi
VH Wysocki
Wai Mun Tham
Wu Fang-Xiang
Wu Yik-Chung
Z Zhang
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Many algorithms have been developed for deciphering the tandem mass spectrometry (MS) data sets. They can be essentially clustered into two classes. The first performs searches on theoretical mass spectrum database, while the second based itself on <it>de novo </it>sequencing from raw mass spectrometry data. It was noted that the quality of mass spectra affects significantly the protein identification processes in both instances. This prompted the authors to explore ways to measure the quality of MS data sets before subjecting them to the protein identification algorithms, thus allowing for more meaningful searches and increased confidence level of proteins identified. Results The proposed method measures the qualities of MS data sets based on the symmetric property of b- and y-ion peaks present in a MS spectrum. Self-convolution on MS data and its time-reversal copy was employed. Due to the symmetric nature of b-ions and y-ions peaks, the self-convolution result of a good spectrum would produce a highest mid point intensity peak. To reduce processing time, self-convolution was achieved using Fast Fourier Transform and its inverse transform, followed by the removal of the "DC" (Direct Current) component and the normalisation of the data set. The quality score was defined as the ratio of the intensity at the mid point to the remaining peaks of the convolution result. The method was validated using both theoretical mass spectra, with various permutations, and several real MS data sets. The results were encouraging, revealing a high percentage of positive prediction rates for spectra with good quality scores. Conclusion We have demonstrated in this work a method for determining the quality of tandem MS data set. By pre-determining the quality of tandem MS data before subjecting them to protein identification algorithms, spurious protein predictions due to poor tandem MS data are avoided, giving scientists greater confidence in the predicted results. We conclude that the algorithm performs well and could potentially be used as a pre-processing for all mass spectrometry based protein identification tools.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Network-based Analysis of Genome Wide Association Data Provides Novel Candidate Genes for Lipid and Lipoprotein Traits

Author: Barabasi Albert-Laszlo
Eriksson Per
Folkersen Lasse
Gulbahce Natali
Ladenvall Claes
Menche Joerg
Orho-Melander Marju
Pevzner Samuel J.
Sharma Amitabh
Publication venue: 'American Society for Biochemistry & Molecular Biology (ASBMB)'
Publication date: 01/01/2013
Field of study

Genome wide association studies (GWAS) identify susceptibility loci for complex traits, but do not identify particular genes of interest. Integration of functional and network information may help in overcoming this limitation and identifying new susceptibility loci. Using GWAS and comorbidity data, we present a network-based approach to predict candidate genes for lipid and lipoprotein traits. We apply a prediction pipeline incorporating interactome, co-expression, and comorbidity data to Global Lipids Genetics Consortium (GLGC) GWAS for four traits of interest, identifying phenotypically coherent modules. These modules provide insights regarding gene involvement in complex phenotypes with multiple susceptibility alleles and low effect sizes. To experimentally test our predictions, we selected four candidate genes and genotyped representative SNPs in the Malmo Diet and Cancer Cardiovascular Cohort. We found significant associations with LDL-C and total-cholesterol levels for a synonymous SNP (rs234706) in the cystathionine beta-synthase (CBS) gene (p = 1 x 10(-5) and adjusted-p = 0.013, respectively). Further, liver samples taken from 206 patients revealed that patients with the minor allele of rs234706 had significant dysregulation of CBS (p = 0.04). Despite the known biological role of CBS in lipid metabolism, SNPs within the locus have not yet been identified in GWAS of lipoprotein traits. Thus, the GWAS-based Comorbidity Module (GCM) approach identifies candidate genes missed by GWAS studies, serving as a broadly applicable tool for the investigation of other complex disease phenotypes

Lund University Publications

PubMed Central

Preliminary Evaluation of Gamification in Residency Training

Author: Auffermann
Burke
Chan
Cheston
Domínguez
Edwin F. Donnelly
Hawkins
Kalia
Kalia
Kapp
Karthik M. Sundaram
Kelly
Lamb
Meaghan Magarik
Patel
Patrick Couture
Ranginwala
Ranschaert
Reed A. Omary
Samuel J. Pevzner
Seidel
Shah
Tso
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Automated Genome Mining of Ribosomal Peptide Natural Products

Author: Arnison P. G.
Bandeira N.
Begley M.
Bradley S. Moore
Duncan M. W.
Eng J. K.
Frank A. M.
Frank A. M.
Heather M. Brewer
Heel A. J.
Hosein Mohimani
Kelly R.
Kersten R. D.
Kim S.
Kodani S.
Livesay E.
Ljiljana Pasa-Tolic
McClerren A. L.
Medema M. H.
Meindl K.
Mingxun Wang
Mohimani H.
Mohimani H.
Nguyen D. D.
Nuno Bandeira
Pavel A. Pevzner
Perkins D. N.
Pevzner P. A.
Pevzner P. A.
Pieter C. Dorrestein
Roland D. Kersten
Samuel O. Purvine
Si Wu
Tsur D.
Ueda K.
Velásquez J. E.
Völler G. H.
Völler G. H.
Warren A. S.
Watrous J.
Wei-Ting Liu
Willey J. M.
Winter J. M.
Zerikly M.
Publication venue: 'American Chemical Society (ACS)'
Publication date
Field of study

Crossref

A Genome-Wide Aberrant RNA Splicing in Patients with Acute Myeloid Leukemia Identifies Novel Potential Disease Markers and Therapeutic Targets

Author: Ast
Benjamin Haibe-Kains
Daniel J. Deangelo
David P. Steensma
Edward A. Fox
Fridman
Gabriela Motyckova
Herve Avet-Loiseau
Ibiayi Dagogo-Jack
Ilene Galinsky
James D. Griffin
John Burke
John Quackenbush
Laurence Lode
Martha Wadleigh
Michal Bar-Natan
Patrick M. Pilarski
Richard Stone
Samuel Pevzner
Sigitas Verselis
Sophia Adamia
Publication venue: 'American Association for Cancer Research (AACR)'
Publication date
Field of study

Crossref

An inter‐species protein–protein interaction network across vast evolutionary distance

In cellular systems, biophysical interactions between macromolecules underlie a complex web of functional interactions. How biophysical and functional networks are coordinated, whether all biophysical interactions correspond to functional interactions, and how such biophysical‐versus‐functional network coordination is shaped by evolutionary forces are all largely unanswered questions. Here, we investigate these questions using an “inter‐interactome” approach. We systematically probed the yeast and human proteomes for interactions between proteins from these two species and functionally characterized the resulting inter‐interactome network. After a billion years of evolutionary divergence, the yeast and human proteomes are still capable of forming a biophysical network with properties that resemble those of intra‐species networks. Although substantially reduced relative to intra‐species networks, the levels of functional overlap in the yeast–human inter‐interactome network uncover significant remnants of co‐functionality widely preserved in the two proteomes beyond human–yeast homologs. Our data support evolutionary selection against biophysical interactions between proteins with little or no co‐functionality. Such non‐functional interactions, however, represent a reservoir from which nascent functional interactions may arise

DSpace@MIT

Crossref

Harvard University - DASH

PubMed Central

Open Repository and Bibliography - Liège

UPF Digital Repository

Interpreting cancer genomes using systematic host network perturbations by tumour virus proteins

Author: Abderazzaq Fieda
Adelmant Guillaume
Askenazi Manor
Byrdsong Danielle
Calderwood Michael A.
Carvunis Anne-Ruxandra
Chen Alyce A.
Cheng Jingwei
Correll Mick
Deo Rahul C.
Dricot Amélie
Duarte Melissa
Fan Changyu
Feltkamp Mariet C.
Ficarro Scott B.
Franchi Rachel
Garg Brijesh K.
Grace Miranda
Gulbahce Natali
Hao Tong
Holthaus Amy M.
James Robert
Korkhin Anna
Litovchick Larisa
Mar Jessica C.
Padi Megha
Pak Theodore R.
Pevzner Samuel J.
Rolland Thomas
Rozenblatt-Rosen Orit
Tavares Maria
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/07/2012
Field of study

Genotypic differences greatly influence susceptibility and resistance to disease. Understanding genotype-phenotype relationships requires that phenotypes be viewed as manifestations of network properties, rather than simply as the result of individual genomic variations. Genome sequencing efforts have identified numerous germline mutations, and large numbers of somatic genomic alterations, associated with a predisposition to cancer. However, it remains difficult to distinguish background, or 'passenger', cancer mutations from causal, or 'driver', mutations in these data sets. Human viruses intrinsically depend on their host cell during the course of infection and can elicit pathological phenotypes similar to those arising from mutations. Here we test the hypothesis that genomic variations and tumour viruses may cause cancer through related mechanisms, by systematically examining host interactome and transcriptome network perturbations caused by DNA tumour virus proteins. The resulting integrated viral perturbation data reflects rewiring of the host cell networks, and highlights pathways, such as Notch signalling and apoptosis, that go awry in cancer. We show that systematic analyses of host targets of viral proteins can identify cancer genes with a success rate on a par with their identification through functional genomics and large-scale cataloguing of tumour mutations. Together, these complementary approaches increase the specificity of cancer gene identification. Combining systems-level studies of pathogen-encoded gene products with genomic approaches will facilitate the prioritization of cancer-causing driver genes to advance the understanding of the genetic basis of human cancer

University of Queensland eSpace

Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing.

While alternative splicing is known to diversify the functional characteristics of some genes, the extent to which protein isoforms globally contribute to functional complexity on a proteomic scale remains unknown. To address this systematically, we cloned full-length open reading frames of alternatively spliced transcripts for a large number of human genes and used protein-protein interaction profiling to functionally compare hundreds of protein isoform pairs. The majority of isoform pairs share less than 50% of their interactions. In the global context of interactome network maps, alternative isoforms tend to behave like distinct proteins rather than minor variants of each other. Interaction partners specific to alternative isoforms tend to be expressed in a highly tissue-specific manner and belong to distinct functional modules. Our strategy, applicable to other functional characteristics, reveals a widespread expansion of protein interaction capabilities through alternative splicing and suggests that many alternative "isoforms" are functionally divergent (i.e., "functional alloforms")

PubMed Central

Open Repository and Bibliography - Liège